Predicting Mortality of COVID-19 Patients based on Data Mining Techniques

J Biomed Phys Eng. 2021 Oct 1;11(5):653-662. doi: 10.31661/jbpe.v0i0.2104-1300. eCollection 2021 Oct.

Abstract

If Coronavirus (COVID-19) is not predicted, managed, and controlled timely, the health systems of any country and their people will face serious problems. Predictive models can be helpful in health resource management and prevent outbreak and death caused by COVID-19. The present study aimed at predicting mortality in patients with COVID-19 based on data mining techniques. To do this study, the mortality factors of COVID-19 patients were first identified based on different studies. These factors were confirmed by specialist physicians. Based on the confirmed factors, the data of COVID-19 patients were extracted from 850 medical records. Decision tree (J48), MLP, KNN, random forest, and SVM data mining models were used for prediction. The models were evaluated based on accuracy, precision, specificity, sensitivity, and the ROC curve. According to the results, the most effective factor used to predict the death of COVID-19 patients was dyspnea. Based on ROC (1.000), accuracy (99.23%), precision (99.74%), sensitivity (98.25%) and specificity (99.84%), the random forest was the best model in predicting of mortality than other models. After the random forest, KNN5, MLP, and J48 models were ranked next, respectively. Data analysis of COVID-19 patients can be a suitable and practical tool for predicting the mortality of these patients. Given the sensitivity of medical science concerning maintaining human life and lack of specialized human resources in the health system, using the proposed models can increase the chances of successful treatment, prevent early death and reduce the costs associated with long treatments for patients, hospitals and the insurance industry.

Keywords: COVID-19; Data Mining; Mortality; Prediction.